INDEXING AND MAPPING OF PROTEINS USING A MODIFIED NONLINEAR SAMMON PROJECTION Nonlinear Sammon Projection of Compositional Space of Proteins can Predict Protein Folding Classes

نویسندگان

  • Izydor Apostol
  • Wojciech Szpankowski
چکیده

A modified Sammon's algorithm was applied to display a relationship between proteins based on their amino acid composition. In the first stage of the method the 19-dimensional compositional space of representative proteins was mapped into 2-dimensinal space using the original Sammon projection to create a contour map. In the second stage, the contour map was used as a reference for newly projected proteins. Data analysis showed that proteins belonging to the same structural class form characteristic and distinct clusters which can be utilized in prediction of structural classes. However, significant overlapping of the clusters has been observed which may explain the limited success of previous protein folding predictions based solely on amino acid composition. Additionally, the modified Sammon's projections can generate a unique index for each individually projected protein related to its amino acid composition which can be a useful parameter in classification of proteins.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Indexing and mapping of proteins using a modified nonlinear Sammon projection

A modified Sammon algorithm was developed to display a relationship between proteins based on their amino acid composition. In the first stage of the method, a 19-dimensional compositional space of representative proteins was mapped into a 2-dimensional space (2-D) using the original Sammon projection creating a contour map. In the second stage, this contour map was used as a reference for new ...

متن کامل

Appendix 2.4 Stopping Rule 2.3 Fine Tuning Using the Basic Lvq1 or Lvq2.1 Lvq Pak: a Program Package for the Correct Application of Learning Vector Quantization Algorithms

The program package is available at the Internet site "cochlea.hut." and will be updated continuously. The indexing of the latest release will always be of the form "lvq pak-X.Y". The instructions below are indexed as in the version lvq pak-1.1, which was released on December 31, 1991. The programs are available in two archive formats, one for the UNIX-environment, the other for MS-DOS, respect...

متن کامل

A reindexing based approach towards mapping of DAG with affine schedules onto parallel embedded systems

We address the problem of optimally mapping uniform DAGs to systolic arrays, given an affine timing function. We introduce an automatic allocation method based on a preprocessing by reindexing that transforms the initial DAG into a new one that enables the well known projection method to minimize the number of processors along a number of directions. We demonstrate its superiority to other meth...

متن کامل

Nonlinear Approximate Indexing for Multimedia Data

This paper presents a new nonlinear approximate indexing method for highdimensional data such as multimedia data. The new indexing method is designed for approximate similarity searches and all the work is performed in the transformed Gaussian space. In this indexing method, we first map the input space into a feature space via the Gaussian mapping, and then compute the top eigenvectors in the ...

متن کامل

Multidimensional Mapping and Indexing of XML

We propose a multidimensional approach to store XML data in relational database systems. In contrast to other efforts we suggest a solution to the problem using established database technology. We present a multidimensional mapping scheme for XML and also thoroughly study the impact of established and commercially available multidimensional index structures (compound B-Trees and UB-Trees) on th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999